Dataset statistics
| Number of variables | 17 |
|---|---|
| Number of observations | 1000 |
| Missing cells | 8666 |
| Missing cells (%) | 51.0% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 132.9 KiB |
| Average record size in memory | 136.1 B |
Variable types
| Numeric | 17 |
|---|
ForestDensityL is highly correlated with ForestDensityM and 5 other fields | High correlation |
ForestDensityM is highly correlated with ForestDensityL and 4 other fields | High correlation |
ForestDensityS is highly correlated with ForestDensityL and 1 other fields | High correlation |
NoisePollutionRailwayL is highly correlated with NoisePollutionRailwayM and 3 other fields | High correlation |
NoisePollutionRailwayM is highly correlated with NoisePollutionRailwayL and 3 other fields | High correlation |
NoisePollutionRailwayS is highly correlated with NoisePollutionRailwayL and 1 other fields | High correlation |
NoisePollutionRoadL is highly correlated with Floor and 8 other fields | High correlation |
NoisePollutionRoadM is highly correlated with ForestDensityL and 7 other fields | High correlation |
NoisePollutionRoadS is highly correlated with NoisePollutionRoadL and 3 other fields | High correlation |
PopulationDensityL is highly correlated with ForestDensityL and 4 other fields | High correlation |
PopulationDensityM is highly correlated with ForestDensityL and 5 other fields | High correlation |
living_area is highly correlated with Plot_area and 1 other fields | High correlation |
rooms_combined is highly correlated with living_area | High correlation |
Floor is highly correlated with NoisePollutionRoadL | High correlation |
Plot_area is highly correlated with living_area | High correlation |
0 has 506 (50.6%) missing values | Missing |
Floor has 818 (81.8%) missing values | Missing |
ForestDensityL has 494 (49.4%) missing values | Missing |
ForestDensityM has 494 (49.4%) missing values | Missing |
ForestDensityS has 494 (49.4%) missing values | Missing |
NoisePollutionRailwayL has 494 (49.4%) missing values | Missing |
NoisePollutionRailwayM has 494 (49.4%) missing values | Missing |
NoisePollutionRailwayS has 494 (49.4%) missing values | Missing |
NoisePollutionRoadL has 494 (49.4%) missing values | Missing |
NoisePollutionRoadM has 494 (49.4%) missing values | Missing |
NoisePollutionRoadS has 494 (49.4%) missing values | Missing |
Plot_area has 883 (88.3%) missing values | Missing |
PopulationDensityL has 494 (49.4%) missing values | Missing |
PopulationDensityM has 494 (49.4%) missing values | Missing |
living_area has 517 (51.7%) missing values | Missing |
rooms_combined has 508 (50.8%) missing values | Missing |
Floor has 34 (3.4%) zeros | Zeros |
ForestDensityL has 33 (3.3%) zeros | Zeros |
ForestDensityM has 126 (12.6%) zeros | Zeros |
ForestDensityS has 308 (30.8%) zeros | Zeros |
NoisePollutionRailwayL has 321 (32.1%) zeros | Zeros |
NoisePollutionRailwayM has 373 (37.3%) zeros | Zeros |
NoisePollutionRailwayS has 437 (43.7%) zeros | Zeros |
Reproduction
| Analysis started | 2023-01-16 15:50:04.055067 |
|---|---|
| Analysis finished | 2023-01-16 15:50:28.406568 |
| Duration | 24.35 seconds |
| Software version | pandas-profiling v3.3.0 |
| Download configuration | config.json |
df_index
Real number (ℝ≥0)
| Distinct | 983 |
|---|---|
| Distinct (%) | 98.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 11145.117 |
| Minimum | 3 |
|---|---|
| Maximum | 21984 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 995.55 |
| Q1 | 5767.75 |
| median | 10869 |
| Q3 | 16701.75 |
| 95-th percentile | 21018.35 |
| Maximum | 21984 |
| Range | 21981 |
| Interquartile range (IQR) | 10934 |
Descriptive statistics
| Standard deviation | 6435.110413 |
|---|---|
| Coefficient of variation (CV) | 0.5773928091 |
| Kurtosis | -1.189647511 |
| Mean | 11145.117 |
| Median Absolute Deviation (MAD) | 5479.5 |
| Skewness | 0.01199401887 |
| Sum | 11145117 |
| Variance | 41410646.03 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 9476 | 2 | 0.2% |
| 20420 | 2 | 0.2% |
| 5564 | 2 | 0.2% |
| 20840 | 2 | 0.2% |
| 7756 | 2 | 0.2% |
| 1288 | 2 | 0.2% |
| 5920 | 2 | 0.2% |
| 2367 | 2 | 0.2% |
| 19889 | 2 | 0.2% |
| 7138 | 2 | 0.2% |
| Other values (973) | 980 |
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 8 | 1 | |
| 62 | 1 | |
| 63 | 1 | |
| 216 | 1 | |
| 226 | 1 | |
| 231 | 1 | |
| 232 | 1 | |
| 245 | 1 | |
| 311 | 1 |
| Value | Count | Frequency (%) |
| 21984 | 1 | |
| 21930 | 1 | |
| 21909 | 1 | |
| 21907 | 1 | |
| 21899 | 1 | |
| 21897 | 1 | |
| 21883 | 1 | |
| 21880 | 1 | |
| 21850 | 1 | |
| 21845 | 1 |
| Distinct | 293 |
|---|---|
| Distinct (%) | 59.3% |
| Missing | 506 |
| Missing (%) | 50.6% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1301541.154 |
| Minimum | 13480 |
|---|---|
| Maximum | 18500000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 13480 |
|---|---|
| 5-th percentile | 276500 |
| Q1 | 544250 |
| median | 856500 |
| Q3 | 1390000 |
| 95-th percentile | 3974750 |
| Maximum | 18500000 |
| Range | 18486520 |
| Interquartile range (IQR) | 845750 |
Descriptive statistics
| Standard deviation | 1599910.921 |
|---|---|
| Coefficient of variation (CV) | 1.229243437 |
| Kurtosis | 37.44928437 |
| Mean | 1301541.154 |
| Median Absolute Deviation (MAD) | 388000 |
| Skewness | 5.021103783 |
| Sum | 642961330 |
| Variance | 2.559714956 × 1012 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 690000 | 10 | 1.0% |
| 720000 | 8 | 0.8% |
| 790000 | 7 | 0.7% |
| 495000 | 6 | 0.6% |
| 1050000 | 6 | 0.6% |
| 990000 | 6 | 0.6% |
| 1250000 | 6 | 0.6% |
| 1290000 | 5 | 0.5% |
| 890000 | 5 | 0.5% |
| 540000 | 5 | 0.5% |
| Other values (283) | 430 | |
| (Missing) | 506 |
| Value | Count | Frequency (%) |
| 13480 | 1 | |
| 100000 | 1 | |
| 110000 | 2 | |
| 128000 | 1 | |
| 135000 | 1 | |
| 145000 | 1 | |
| 149000 | 1 | |
| 160000 | 1 | |
| 165000 | 1 | |
| 175000 | 1 |
| Value | Count | Frequency (%) |
| 18500000 | 1 | 0.1% |
| 12900000 | 1 | 0.1% |
| 9900000 | 2 | |
| 9490000 | 1 | 0.1% |
| 8000000 | 1 | 0.1% |
| 7000000 | 1 | 0.1% |
| 6500000 | 1 | 0.1% |
| 5900000 | 4 | |
| 5500000 | 1 | 0.1% |
| 5450000 | 1 | 0.1% |
| Distinct | 9 |
|---|---|
| Distinct (%) | 4.9% |
| Missing | 818 |
| Missing (%) | 81.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.214285714 |
| Minimum | 0 |
|---|---|
| Maximum | 999 |
| Zeros | 34 |
| Zeros (%) | 3.4% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 1 |
| median | 2 |
| Q3 | 2 |
| 95-th percentile | 4.95 |
| Maximum | 999 |
| Range | 999 |
| Interquartile range (IQR) | 1 |
Descriptive statistics
| Standard deviation | 73.93616191 |
|---|---|
| Coefficient of variation (CV) | 10.2485769 |
| Kurtosis | 181.8604159 |
| Mean | 7.214285714 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 13.48302891 |
| Sum | 1313 |
| Variance | 5466.556038 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 54 | 5.4% |
| 2 | 52 | 5.2% |
| 0 | 34 | 3.4% |
| 3 | 24 | 2.4% |
| 4 | 8 | 0.8% |
| 5 | 6 | 0.6% |
| 7 | 2 | 0.2% |
| 999 | 1 | 0.1% |
| 8 | 1 | 0.1% |
| (Missing) | 818 |
| Value | Count | Frequency (%) |
| 0 | 34 | |
| 1 | 54 | |
| 2 | 52 | |
| 3 | 24 | |
| 4 | 8 | 0.8% |
| 5 | 6 | 0.6% |
| 7 | 2 | 0.2% |
| 8 | 1 | 0.1% |
| 999 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 999 | 1 | 0.1% |
| 8 | 1 | 0.1% |
| 7 | 2 | 0.2% |
| 5 | 6 | 0.6% |
| 4 | 8 | 0.8% |
| 3 | 24 | |
| 2 | 52 | |
| 1 | 54 | |
| 0 | 34 |
| Distinct | 385 |
|---|---|
| Distinct (%) | 76.1% |
| Missing | 494 |
| Missing (%) | 49.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.20616829 |
| Minimum | 0 |
|---|---|
| Maximum | 0.7818335524 |
| Zeros | 33 |
| Zeros (%) | 3.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0.02856369354 |
| median | 0.1332614169 |
| Q3 | 0.3345537874 |
| 95-th percentile | 0.624348801 |
| Maximum | 0.7818335524 |
| Range | 0.7818335524 |
| Interquartile range (IQR) | 0.3059900938 |
Descriptive statistics
| Standard deviation | 0.2129651927 |
|---|---|
| Coefficient of variation (CV) | 1.032967741 |
| Kurtosis | -0.1676070279 |
| Mean | 0.20616829 |
| Median Absolute Deviation (MAD) | 0.1187016948 |
| Skewness | 0.9996778791 |
| Sum | 104.3211547 |
| Variance | 0.04535417329 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 33 | 3.3% |
| 0.7287375493 | 7 | 0.7% |
| 0.001463765284 | 5 | 0.5% |
| 0.5293040392 | 4 | 0.4% |
| 0.6201703484 | 4 | 0.4% |
| 0.2626010286 | 3 | 0.3% |
| 0.03159867475 | 3 | 0.3% |
| 0.02855305306 | 3 | 0.3% |
| 0.02200244445 | 3 | 0.3% |
| 0.3624860387 | 3 | 0.3% |
| Other values (375) | 438 | |
| (Missing) | 494 |
| Value | Count | Frequency (%) |
| 0 | 33 | |
| 0.0005804289896 | 1 | 0.1% |
| 0.0008519206946 | 1 | 0.1% |
| 0.001439866706 | 2 | 0.2% |
| 0.001463765284 | 5 | 0.5% |
| 0.001480421026 | 1 | 0.1% |
| 0.001526740644 | 1 | 0.1% |
| 0.001598434195 | 1 | 0.1% |
| 0.0016738081 | 1 | 0.1% |
| 0.001675944725 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 0.7818335524 | 2 | 0.2% |
| 0.7703403707 | 2 | 0.2% |
| 0.73528662 | 1 | 0.1% |
| 0.7317917222 | 1 | 0.1% |
| 0.729199132 | 1 | 0.1% |
| 0.7287375493 | 7 | |
| 0.6976655763 | 2 | 0.2% |
| 0.6961794294 | 1 | 0.1% |
| 0.6595866014 | 2 | 0.2% |
| 0.6562600723 | 1 | 0.1% |
| Distinct | 309 |
|---|---|
| Distinct (%) | 61.1% |
| Missing | 494 |
| Missing (%) | 49.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1395802141 |
| Minimum | 0 |
|---|---|
| Maximum | 0.7873727282 |
| Zeros | 126 |
| Zeros (%) | 12.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 3.628899293 × 10-6 |
| median | 0.03940907754 |
| Q3 | 0.2394775828 |
| 95-th percentile | 0.5315413164 |
| Maximum | 0.7873727282 |
| Range | 0.7873727282 |
| Interquartile range (IQR) | 0.2394739539 |
Descriptive statistics
| Standard deviation | 0.1902572063 |
|---|---|
| Coefficient of variation (CV) | 1.363067162 |
| Kurtosis | 0.7582242843 |
| Mean | 0.1395802141 |
| Median Absolute Deviation (MAD) | 0.03940907754 |
| Skewness | 1.367035369 |
| Sum | 70.62758832 |
| Variance | 0.03619780453 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 126 | 12.6% |
| 0.6376854901 | 7 | 0.7% |
| 0.3640016896 | 4 | 0.4% |
| 0.3527967296 | 4 | 0.4% |
| 0.3337455987 | 3 | 0.3% |
| 0.0002160058529 | 3 | 0.3% |
| 0.006021573685 | 3 | 0.3% |
| 0.3871355314 | 3 | 0.3% |
| 0.08778971087 | 3 | 0.3% |
| 8.920410497 × 10-6 | 3 | 0.3% |
| Other values (299) | 347 | |
| (Missing) | 494 |
| Value | Count | Frequency (%) |
| 0 | 126 | |
| 3.602476365 × 10-6 | 1 | 0.1% |
| 3.708168076 × 10-6 | 2 | 0.2% |
| 7.582025223 × 10-6 | 1 | 0.1% |
| 8.920410497 × 10-6 | 3 | 0.3% |
| 1.28428226 × 10-5 | 1 | 0.1% |
| 1.328139803 × 10-5 | 1 | 0.1% |
| 3.541161623 × 10-5 | 2 | 0.2% |
| 5.282093802 × 10-5 | 1 | 0.1% |
| 0.0001419647445 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 0.7873727282 | 2 | 0.2% |
| 0.7065104182 | 2 | 0.2% |
| 0.684303994 | 2 | 0.2% |
| 0.6614440924 | 1 | 0.1% |
| 0.659423634 | 1 | 0.1% |
| 0.6376854901 | 7 | |
| 0.6212207448 | 1 | 0.1% |
| 0.610363301 | 1 | 0.1% |
| 0.5996403899 | 1 | 0.1% |
| 0.596064267 | 1 | 0.1% |
| Distinct | 149 |
|---|---|
| Distinct (%) | 29.4% |
| Missing | 494 |
| Missing (%) | 49.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.08885116357 |
| Minimum | 0 |
|---|---|
| Maximum | 0.7699277131 |
| Zeros | 308 |
| Zeros (%) | 30.8% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0.07102366901 |
| 95-th percentile | 0.4956389601 |
| Maximum | 0.7699277131 |
| Range | 0.7699277131 |
| Interquartile range (IQR) | 0.07102366901 |
Descriptive statistics
| Standard deviation | 0.1707718521 |
|---|---|
| Coefficient of variation (CV) | 1.921999051 |
| Kurtosis | 2.870959633 |
| Mean | 0.08885116357 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 1.994489453 |
| Sum | 44.95868877 |
| Variance | 0.02916302547 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 308 | |
| 0.4898974575 | 19 | 1.9% |
| 0.5559857975 | 7 | 0.7% |
| 0.2503022174 | 4 | 0.4% |
| 0.00096677693 | 4 | 0.4% |
| 0.01415137706 | 3 | 0.3% |
| 0.151862973 | 3 | 0.3% |
| 0.2796496856 | 3 | 0.3% |
| 0.1465924172 | 2 | 0.2% |
| 0.06257414312 | 2 | 0.2% |
| Other values (139) | 151 | 15.1% |
| (Missing) | 494 |
| Value | Count | Frequency (%) |
| 0 | 308 | |
| 0.0002872407762 | 1 | 0.1% |
| 0.0004468745144 | 1 | 0.1% |
| 0.0004771991623 | 1 | 0.1% |
| 0.00096677693 | 4 | 0.4% |
| 0.001321255177 | 1 | 0.1% |
| 0.001564598797 | 1 | 0.1% |
| 0.001629041449 | 1 | 0.1% |
| 0.004111235764 | 1 | 0.1% |
| 0.004231411001 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 0.7699277131 | 2 | 0.2% |
| 0.7479678248 | 1 | 0.1% |
| 0.6988524424 | 1 | 0.1% |
| 0.6220103268 | 2 | 0.2% |
| 0.5918542718 | 1 | 0.1% |
| 0.5793579672 | 1 | 0.1% |
| 0.5694486392 | 1 | 0.1% |
| 0.5559857975 | 7 | |
| 0.5430213941 | 1 | 0.1% |
| 0.5330776338 | 2 | 0.2% |
| Distinct | 154 |
|---|---|
| Distinct (%) | 30.4% |
| Missing | 494 |
| Missing (%) | 49.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.01155228602 |
| Minimum | 0 |
|---|---|
| Maximum | 0.1282648481 |
| Zeros | 321 |
| Zeros (%) | 32.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0.01076803394 |
| 95-th percentile | 0.05986415372 |
| Maximum | 0.1282648481 |
| Range | 0.1282648481 |
| Interquartile range (IQR) | 0.01076803394 |
Descriptive statistics
| Standard deviation | 0.02319201844 |
|---|---|
| Coefficient of variation (CV) | 2.007569618 |
| Kurtosis | 6.222613121 |
| Mean | 0.01155228602 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.470267201 |
| Sum | 5.845456727 |
| Variance | 0.0005378697192 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 321 | |
| 0.01932878081 | 5 | 0.5% |
| 0.04562604354 | 4 | 0.4% |
| 0.00115377722 | 3 | 0.3% |
| 0.039871799 | 3 | 0.3% |
| 0.02077208207 | 3 | 0.3% |
| 0.004941278682 | 2 | 0.2% |
| 0.04487867115 | 2 | 0.2% |
| 0.05643579643 | 2 | 0.2% |
| 0.01488975214 | 2 | 0.2% |
| Other values (144) | 159 | 15.9% |
| (Missing) | 494 |
| Value | Count | Frequency (%) |
| 0 | 321 | |
| 3.268614761 × 10-5 | 1 | 0.1% |
| 4.611411707 × 10-5 | 1 | 0.1% |
| 8.449957979 × 10-5 | 1 | 0.1% |
| 0.0001552295097 | 1 | 0.1% |
| 0.0001992428771 | 1 | 0.1% |
| 0.0003123945908 | 1 | 0.1% |
| 0.0003175720135 | 2 | 0.2% |
| 0.0006874971274 | 1 | 0.1% |
| 0.0007579828063 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 0.1282648481 | 1 | |
| 0.1168707089 | 2 | |
| 0.1089460049 | 1 | |
| 0.108805809 | 1 | |
| 0.1087460215 | 1 | |
| 0.100780341 | 2 | |
| 0.1004778421 | 1 | |
| 0.09917740412 | 1 | |
| 0.09453861262 | 1 | |
| 0.09211155247 | 1 |
| Distinct | 117 |
|---|---|
| Distinct (%) | 23.1% |
| Missing | 494 |
| Missing (%) | 49.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.01085048205 |
| Minimum | 0 |
|---|---|
| Maximum | 0.1793079249 |
| Zeros | 373 |
| Zeros (%) | 37.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 8.26512962 × 10-5 |
| 95-th percentile | 0.07467264392 |
| Maximum | 0.1793079249 |
| Range | 0.1793079249 |
| Interquartile range (IQR) | 8.26512962 × 10-5 |
Descriptive statistics
| Standard deviation | 0.02972206689 |
|---|---|
| Coefficient of variation (CV) | 2.739239303 |
| Kurtosis | 10.72970499 |
| Mean | 0.01085048205 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 3.291599476 |
| Sum | 5.490343919 |
| Variance | 0.0008834012605 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 373 | |
| 0.02060273537 | 4 | 0.4% |
| 0.0005495015788 | 3 | 0.3% |
| 0.03541653254 | 2 | 0.2% |
| 0.004619852165 | 2 | 0.2% |
| 0.06663957741 | 2 | 0.2% |
| 0.1348221032 | 2 | 0.2% |
| 0.03387398426 | 2 | 0.2% |
| 0.009200991609 | 2 | 0.2% |
| 0.001221348246 | 2 | 0.2% |
| Other values (107) | 112 | 11.2% |
| (Missing) | 494 |
| Value | Count | Frequency (%) |
| 0 | 373 | |
| 4.619364375 × 10-5 | 1 | 0.1% |
| 6.1447708 × 10-5 | 1 | 0.1% |
| 6.161049843 × 10-5 | 1 | 0.1% |
| 7.728930934 × 10-5 | 1 | 0.1% |
| 7.85274532 × 10-5 | 2 | 0.2% |
| 8.402591054 × 10-5 | 1 | 0.1% |
| 0.0001069159335 | 2 | 0.2% |
| 0.0001193583296 | 1 | 0.1% |
| 0.0001939548163 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 0.1793079249 | 1 | |
| 0.1654086827 | 1 | |
| 0.1556886228 | 1 | |
| 0.1453238147 | 1 | |
| 0.1431509421 | 1 | |
| 0.1398084531 | 1 | |
| 0.1387629758 | 1 | |
| 0.1348221032 | 2 | |
| 0.1295575131 | 1 | |
| 0.1288569425 | 1 |
| Distinct | 62 |
|---|---|
| Distinct (%) | 12.3% |
| Missing | 494 |
| Missing (%) | 49.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.008839980359 |
| Minimum | 0 |
|---|---|
| Maximum | 0.2987745098 |
| Zeros | 437 |
| Zeros (%) | 43.7% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 0 |
| 95-th percentile | 0.04748022101 |
| Maximum | 0.2987745098 |
| Range | 0.2987745098 |
| Interquartile range (IQR) | 0 |
Descriptive statistics
| Standard deviation | 0.03638141229 |
|---|---|
| Coefficient of variation (CV) | 4.115553521 |
| Kurtosis | 29.56983729 |
| Mean | 0.008839980359 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 5.23863082 |
| Sum | 4.473030062 |
| Variance | 0.00132360716 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 437 | |
| 0.02737187632 | 2 | 0.2% |
| 0.04132762313 | 2 | 0.2% |
| 0.0006045949214 | 2 | 0.2% |
| 0.01239067055 | 2 | 0.2% |
| 0.001018329939 | 2 | 0.2% |
| 0.2987745098 | 2 | 0.2% |
| 0.02153518124 | 2 | 0.2% |
| 0.02395276292 | 2 | 0.2% |
| 0.1194029851 | 1 | 0.1% |
| Other values (52) | 52 | 5.2% |
| (Missing) | 494 |
| Value | Count | Frequency (%) |
| 0 | 437 | |
| 5.006007209 × 10-5 | 1 | 0.1% |
| 0.0002006420546 | 1 | 0.1% |
| 0.0002008838891 | 1 | 0.1% |
| 0.0002016129032 | 1 | 0.1% |
| 0.0003018108652 | 1 | 0.1% |
| 0.0006045949214 | 2 | 0.2% |
| 0.000702811245 | 1 | 0.1% |
| 0.000860672337 | 1 | 0.1% |
| 0.001018329939 | 2 | 0.2% |
| Value | Count | Frequency (%) |
| 0.2987745098 | 2 | |
| 0.2456785346 | 1 | |
| 0.2232097187 | 1 | |
| 0.2112608932 | 1 | |
| 0.2010171306 | 1 | |
| 0.1892801657 | 1 | |
| 0.1820264766 | 1 | |
| 0.179613936 | 1 | |
| 0.1724318658 | 1 | |
| 0.1682409225 | 1 |
| Distinct | 411 |
|---|---|
| Distinct (%) | 81.2% |
| Missing | 494 |
| Missing (%) | 49.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2256832477 |
| Minimum | 0 |
|---|---|
| Maximum | 0.6389892784 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.04701504348 |
| Q1 | 0.1320421044 |
| median | 0.2109647519 |
| Q3 | 0.3036385704 |
| 95-th percentile | 0.4709108442 |
| Maximum | 0.6389892784 |
| Range | 0.6389892784 |
| Interquartile range (IQR) | 0.171596466 |
Descriptive statistics
| Standard deviation | 0.1273428966 |
|---|---|
| Coefficient of variation (CV) | 0.5642549809 |
| Kurtosis | 0.03004224388 |
| Mean | 0.2256832477 |
| Median Absolute Deviation (MAD) | 0.08331380384 |
| Skewness | 0.653235523 |
| Sum | 114.1957233 |
| Variance | 0.01621621332 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.04701504348 | 7 | 0.7% |
| 0.5066819997 | 5 | 0.5% |
| 0.1496147448 | 4 | 0.4% |
| 0.2248514537 | 4 | 0.4% |
| 0.06140974754 | 4 | 0.4% |
| 0.1307240185 | 3 | 0.3% |
| 0.2737136762 | 3 | 0.3% |
| 0.3232975604 | 3 | 0.3% |
| 0.1612454079 | 3 | 0.3% |
| 0.6389892784 | 3 | 0.3% |
| Other values (401) | 467 | |
| (Missing) | 494 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.006997403447 | 1 | |
| 0.007135999625 | 1 | |
| 0.01611291057 | 1 | |
| 0.02401064667 | 1 | |
| 0.02465617233 | 1 | |
| 0.02551020408 | 1 | |
| 0.02565586003 | 1 | |
| 0.02667196782 | 1 | |
| 0.03106093239 | 2 |
| Value | Count | Frequency (%) |
| 0.6389892784 | 3 | |
| 0.5435669138 | 1 | 0.1% |
| 0.5377029475 | 1 | 0.1% |
| 0.5347788582 | 1 | 0.1% |
| 0.5341028354 | 1 | 0.1% |
| 0.5112167319 | 1 | 0.1% |
| 0.5103918175 | 1 | 0.1% |
| 0.5094313976 | 1 | 0.1% |
| 0.5073983323 | 3 | |
| 0.5066819997 | 5 |
| Distinct | 411 |
|---|---|
| Distinct (%) | 81.2% |
| Missing | 494 |
| Missing (%) | 49.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2502618431 |
| Minimum | 0 |
|---|---|
| Maximum | 0.6164580887 |
| Zeros | 1 |
| Zeros (%) | 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.04905241535 |
| Q1 | 0.1476138402 |
| median | 0.2484047449 |
| Q3 | 0.3433173736 |
| 95-th percentile | 0.4836870039 |
| Maximum | 0.6164580887 |
| Range | 0.6164580887 |
| Interquartile range (IQR) | 0.1957035334 |
Descriptive statistics
| Standard deviation | 0.1288296489 |
|---|---|
| Coefficient of variation (CV) | 0.5147794299 |
| Kurtosis | -0.6397234614 |
| Mean | 0.2502618431 |
| Median Absolute Deviation (MAD) | 0.09687805643 |
| Skewness | 0.1620821236 |
| Sum | 126.6324926 |
| Variance | 0.01659707844 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.04905241535 | 7 | 0.7% |
| 0.4839229697 | 5 | 0.5% |
| 0.2314881042 | 4 | 0.4% |
| 0.3271626601 | 4 | 0.4% |
| 0.06543397192 | 4 | 0.4% |
| 0.2207728006 | 3 | 0.3% |
| 0.402397115 | 3 | 0.3% |
| 0.3151997125 | 3 | 0.3% |
| 0.2403281918 | 3 | 0.3% |
| 0.488606934 | 3 | 0.3% |
| Other values (401) | 467 | |
| (Missing) | 494 |
| Value | Count | Frequency (%) |
| 0 | 1 | |
| 0.006943998974 | 1 | |
| 0.007639064077 | 2 | |
| 0.007958528037 | 1 | |
| 0.01154838078 | 1 | |
| 0.01490223739 | 1 | |
| 0.01658760177 | 1 | |
| 0.01853273654 | 1 | |
| 0.02155172414 | 1 | |
| 0.02312905625 | 1 |
| Value | Count | Frequency (%) |
| 0.6164580887 | 1 | |
| 0.5961194087 | 1 | |
| 0.58900187 | 1 | |
| 0.5254700825 | 1 | |
| 0.519451656 | 1 | |
| 0.516381358 | 1 | |
| 0.5155046079 | 1 | |
| 0.5131660471 | 1 | |
| 0.5013989467 | 2 | |
| 0.4960551719 | 2 |
| Distinct | 407 |
|---|---|
| Distinct (%) | 80.4% |
| Missing | 494 |
| Missing (%) | 49.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2784532628 |
| Minimum | 0 |
|---|---|
| Maximum | 0.6861177525 |
| Zeros | 6 |
| Zeros (%) | 0.6% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.04170959191 |
| Q1 | 0.1573945616 |
| median | 0.2817924959 |
| Q3 | 0.3964871367 |
| 95-th percentile | 0.5150859135 |
| Maximum | 0.6861177525 |
| Range | 0.6861177525 |
| Interquartile range (IQR) | 0.2390925751 |
Descriptive statistics
| Standard deviation | 0.1521066892 |
|---|---|
| Coefficient of variation (CV) | 0.5462557261 |
| Kurtosis | -0.5762569542 |
| Mean | 0.2784532628 |
| Median Absolute Deviation (MAD) | 0.1204538953 |
| Skewness | 0.1908166936 |
| Sum | 140.897351 |
| Variance | 0.02313644491 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.09710078233 | 7 | 0.7% |
| 0 | 6 | 0.6% |
| 0.6861177525 | 5 | 0.5% |
| 0.2943943299 | 4 | 0.4% |
| 0.09106305367 | 4 | 0.4% |
| 0.3090667454 | 4 | 0.4% |
| 0.1063423332 | 3 | 0.3% |
| 0.4433722322 | 3 | 0.3% |
| 0.2807410423 | 3 | 0.3% |
| 0.1152146465 | 3 | 0.3% |
| Other values (397) | 464 | |
| (Missing) | 494 |
| Value | Count | Frequency (%) |
| 0 | 6 | |
| 0.0005070993915 | 1 | 0.1% |
| 0.001526872964 | 1 | 0.1% |
| 0.002092050209 | 1 | 0.1% |
| 0.002295918367 | 2 | 0.2% |
| 0.009830866808 | 1 | 0.1% |
| 0.01236690328 | 1 | 0.1% |
| 0.01635514019 | 1 | 0.1% |
| 0.01680765806 | 1 | 0.1% |
| 0.02122940431 | 1 | 0.1% |
| Value | Count | Frequency (%) |
| 0.6861177525 | 5 | |
| 0.6723805855 | 1 | 0.1% |
| 0.6452469464 | 1 | 0.1% |
| 0.6182753165 | 1 | 0.1% |
| 0.6072145726 | 1 | 0.1% |
| 0.5980448534 | 2 | 0.2% |
| 0.5901047565 | 2 | 0.2% |
| 0.5825529818 | 1 | 0.1% |
| 0.5778199566 | 1 | 0.1% |
| 0.5495585875 | 1 | 0.1% |
| Distinct | 109 |
|---|---|
| Distinct (%) | 93.2% |
| Missing | 883 |
| Missing (%) | 88.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 893.1196581 |
| Minimum | 30 |
|---|---|
| Maximum | 9000 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 30 |
|---|---|
| 5-th percentile | 89.8 |
| Q1 | 342 |
| median | 619 |
| Q3 | 1000 |
| 95-th percentile | 1917.8 |
| Maximum | 9000 |
| Range | 8970 |
| Interquartile range (IQR) | 658 |
Descriptive statistics
| Standard deviation | 1203.536872 |
|---|---|
| Coefficient of variation (CV) | 1.34756509 |
| Kurtosis | 24.29570627 |
| Mean | 893.1196581 |
| Median Absolute Deviation (MAD) | 329 |
| Skewness | 4.516865967 |
| Sum | 104495 |
| Variance | 1448501.003 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1200 | 2 | 0.2% |
| 417 | 2 | 0.2% |
| 400 | 2 | 0.2% |
| 500 | 2 | 0.2% |
| 360 | 2 | 0.2% |
| 699 | 2 | 0.2% |
| 403 | 2 | 0.2% |
| 1000 | 2 | 0.2% |
| 840 | 1 | 0.1% |
| 656 | 1 | 0.1% |
| Other values (99) | 99 | 9.9% |
| (Missing) | 883 |
| Value | Count | Frequency (%) |
| 30 | 1 | |
| 52 | 1 | |
| 54 | 1 | |
| 60 | 1 | |
| 70 | 1 | |
| 85 | 1 | |
| 91 | 1 | |
| 93 | 1 | |
| 100 | 1 | |
| 124 | 1 |
| Value | Count | Frequency (%) |
| 9000 | 1 | |
| 6754 | 1 | |
| 6110 | 1 | |
| 3564 | 1 | |
| 3000 | 1 | |
| 2237 | 1 | |
| 1838 | 1 | |
| 1819 | 1 | |
| 1800 | 1 | |
| 1789 | 1 |
| Distinct | 411 |
|---|---|
| Distinct (%) | 81.2% |
| Missing | 494 |
| Missing (%) | 49.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.1606907277 |
| Minimum | 0.002524030382 |
|---|---|
| Maximum | 0.6875655101 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0.002524030382 |
|---|---|
| 5-th percentile | 0.01220132729 |
| Q1 | 0.05528064102 |
| median | 0.1263144656 |
| Q3 | 0.2266858282 |
| 95-th percentile | 0.4453161186 |
| Maximum | 0.6875655101 |
| Range | 0.6850414797 |
| Interquartile range (IQR) | 0.1714051872 |
Descriptive statistics
| Standard deviation | 0.1371583185 |
|---|---|
| Coefficient of variation (CV) | 0.8535546539 |
| Kurtosis | 1.462343308 |
| Mean | 0.1606907277 |
| Median Absolute Deviation (MAD) | 0.07729745812 |
| Skewness | 1.304317045 |
| Sum | 81.30950824 |
| Variance | 0.01881240434 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.009790557138 | 7 | 0.7% |
| 0.5910298941 | 5 | 0.5% |
| 0.02945662323 | 4 | 0.4% |
| 0.3154787289 | 4 | 0.4% |
| 0.05528064102 | 4 | 0.4% |
| 0.1174574682 | 3 | 0.3% |
| 0.3672270775 | 3 | 0.3% |
| 0.2426281534 | 3 | 0.3% |
| 0.1478324418 | 3 | 0.3% |
| 0.3973680009 | 3 | 0.3% |
| Other values (401) | 467 | |
| (Missing) | 494 |
| Value | Count | Frequency (%) |
| 0.002524030382 | 1 | |
| 0.002526073936 | 2 | |
| 0.003081595886 | 2 | |
| 0.00480647159 | 1 | |
| 0.005064589232 | 1 | |
| 0.006554732407 | 1 | |
| 0.006901511382 | 1 | |
| 0.008104293296 | 1 | |
| 0.009167287958 | 2 | |
| 0.009777679344 | 1 |
| Value | Count | Frequency (%) |
| 0.6875655101 | 1 | 0.1% |
| 0.6308562288 | 1 | 0.1% |
| 0.629389887 | 1 | 0.1% |
| 0.5984419421 | 1 | 0.1% |
| 0.5910298941 | 5 | |
| 0.5770003856 | 1 | 0.1% |
| 0.5699265114 | 1 | 0.1% |
| 0.5699114807 | 1 | 0.1% |
| 0.5604369286 | 1 | 0.1% |
| 0.5344437723 | 1 | 0.1% |
| Distinct | 411 |
|---|---|
| Distinct (%) | 81.2% |
| Missing | 494 |
| Missing (%) | 49.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 0.2298702705 |
| Minimum | 0.001466111698 |
|---|---|
| Maximum | 0.9239637627 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 0.001466111698 |
|---|---|
| 5-th percentile | 0.01875199438 |
| Q1 | 0.1056895788 |
| median | 0.1894008618 |
| Q3 | 0.3294739982 |
| 95-th percentile | 0.5356565985 |
| Maximum | 0.9239637627 |
| Range | 0.922497651 |
| Interquartile range (IQR) | 0.2237844194 |
Descriptive statistics
| Standard deviation | 0.1622518477 |
|---|---|
| Coefficient of variation (CV) | 0.7058409393 |
| Kurtosis | 0.5054301827 |
| Mean | 0.2298702705 |
| Median Absolute Deviation (MAD) | 0.1027711536 |
| Skewness | 0.8968429926 |
| Sum | 116.3143569 |
| Variance | 0.02632566208 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.01630176353 | 7 | 0.7% |
| 0.4730860222 | 5 | 0.5% |
| 0.06866297921 | 4 | 0.4% |
| 0.4591364746 | 4 | 0.4% |
| 0.151852859 | 4 | 0.4% |
| 0.1302732712 | 3 | 0.3% |
| 0.5356565985 | 3 | 0.3% |
| 0.2716471825 | 3 | 0.3% |
| 0.2323960079 | 3 | 0.3% |
| 0.5635840506 | 3 | 0.3% |
| Other values (401) | 467 | |
| (Missing) | 494 |
| Value | Count | Frequency (%) |
| 0.001466111698 | 2 | |
| 0.004769999161 | 1 | |
| 0.0071651781 | 1 | |
| 0.007725378074 | 2 | |
| 0.007982795565 | 2 | |
| 0.008089177326 | 1 | |
| 0.009293047474 | 1 | |
| 0.009569669956 | 1 | |
| 0.01266378531 | 1 | |
| 0.01293405526 | 1 |
| Value | Count | Frequency (%) |
| 0.9239637627 | 1 | |
| 0.7773642217 | 1 | |
| 0.7370275748 | 1 | |
| 0.7130785631 | 1 | |
| 0.706068619 | 1 | |
| 0.6968112052 | 1 | |
| 0.6949685332 | 1 | |
| 0.6277301556 | 1 | |
| 0.6025817198 | 1 | |
| 0.5988690552 | 1 |
| Distinct | 196 |
|---|---|
| Distinct (%) | 40.6% |
| Missing | 517 |
| Missing (%) | 51.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 151.757764 |
| Minimum | 25 |
|---|---|
| Maximum | 700 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 25 |
|---|---|
| 5-th percentile | 53.1 |
| Q1 | 95.5 |
| median | 130 |
| Q3 | 181 |
| 95-th percentile | 319.7 |
| Maximum | 700 |
| Range | 675 |
| Interquartile range (IQR) | 85.5 |
Descriptive statistics
| Standard deviation | 89.96134092 |
|---|---|
| Coefficient of variation (CV) | 0.5927956406 |
| Kurtosis | 7.13860986 |
| Mean | 151.757764 |
| Median Absolute Deviation (MAD) | 40 |
| Skewness | 2.172213113 |
| Sum | 73299 |
| Variance | 8093.04286 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 160 | 17 | 1.7% |
| 130 | 12 | 1.2% |
| 120 | 10 | 1.0% |
| 100 | 9 | 0.9% |
| 300 | 9 | 0.9% |
| 90 | 8 | 0.8% |
| 140 | 8 | 0.8% |
| 97 | 7 | 0.7% |
| 180 | 7 | 0.7% |
| 110 | 6 | 0.6% |
| Other values (186) | 390 | |
| (Missing) | 517 |
| Value | Count | Frequency (%) |
| 25 | 1 | 0.1% |
| 35 | 1 | 0.1% |
| 37 | 1 | 0.1% |
| 38 | 1 | 0.1% |
| 39 | 1 | 0.1% |
| 40 | 4 | |
| 43 | 1 | 0.1% |
| 46 | 1 | 0.1% |
| 48 | 2 | |
| 49 | 2 |
| Value | Count | Frequency (%) |
| 700 | 1 | |
| 600 | 1 | |
| 583 | 1 | |
| 560 | 1 | |
| 550 | 1 | |
| 509 | 1 | |
| 500 | 1 | |
| 450 | 1 | |
| 430 | 1 | |
| 425 | 1 |
| Distinct | 25 |
|---|---|
| Distinct (%) | 5.1% |
| Missing | 508 |
| Missing (%) | 50.8% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.960365854 |
| Minimum | 1 |
|---|---|
| Maximum | 18 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 7.9 KiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 2.5 |
| Q1 | 3.5 |
| median | 4.5 |
| Q3 | 5.5 |
| 95-th percentile | 8 |
| Maximum | 18 |
| Range | 17 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.980802535 |
|---|---|
| Coefficient of variation (CV) | 0.3993258953 |
| Kurtosis | 6.361611852 |
| Mean | 4.960365854 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 1.560790132 |
| Sum | 2440.5 |
| Variance | 3.923578685 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 4.5 | 127 | 12.7% |
| 5.5 | 84 | 8.4% |
| 3.5 | 71 | 7.1% |
| 6.5 | 39 | 3.9% |
| 2.5 | 38 | 3.8% |
| 6 | 21 | 2.1% |
| 7.5 | 14 | 1.4% |
| 5 | 14 | 1.4% |
| 7 | 13 | 1.3% |
| 8 | 10 | 1.0% |
| Other values (15) | 61 | 6.1% |
| (Missing) | 508 |
| Value | Count | Frequency (%) |
| 1 | 5 | 0.5% |
| 1.5 | 7 | 0.7% |
| 2 | 8 | 0.8% |
| 2.5 | 38 | 3.8% |
| 3 | 10 | 1.0% |
| 3.5 | 71 | |
| 4 | 8 | 0.8% |
| 4.5 | 127 | |
| 5 | 14 | 1.4% |
| 5.5 | 84 |
| Value | Count | Frequency (%) |
| 18 | 1 | 0.1% |
| 16 | 1 | 0.1% |
| 13 | 1 | 0.1% |
| 12.5 | 1 | 0.1% |
| 11 | 5 | |
| 10.5 | 1 | 0.1% |
| 10 | 3 | |
| 9.5 | 3 | |
| 9 | 3 | |
| 8.5 | 4 |
Spearman's ρ
The Spearman's rank correlation coefficient (ρ) is a measure of monotonic correlation between two variables, and is therefore better in catching nonlinear monotonic correlations than Pearson's r. It's value lies between -1 and +1, -1 indicating total negative monotonic correlation, 0 indicating no monotonic correlation and 1 indicating total positive monotonic correlation.To calculate ρ for two variables X and Y, one divides the covariance of the rank variables of X and Y by the product of their standard deviations.
Pearson's r
The Pearson's correlation coefficient (r) is a measure of linear correlation between two variables. It's value lies between -1 and +1, -1 indicating total negative linear correlation, 0 indicating no linear correlation and 1 indicating total positive linear correlation. Furthermore, r is invariant under separate changes in location and scale of the two variables, implying that for a linear function the angle to the x-axis does not affect r.To calculate r for two variables X and Y, one divides the covariance of X and Y by the product of their standard deviations.
Kendall's τ
Similarly to Spearman's rank correlation coefficient, the Kendall rank correlation coefficient (τ) measures ordinal association between two variables. It's value lies between -1 and +1, -1 indicating total negative correlation, 0 indicating no correlation and 1 indicating total positive correlation.To calculate τ for two variables X and Y, one determines the number of concordant and discordant pairs of observations. τ is given by the number of concordant pairs minus the discordant pairs divided by the total number of pairs.
Phik (φk)
Phik (φk) is a new and practical correlation coefficient that works consistently between categorical, ordinal and interval variables, captures non-linear dependency and reverts to the Pearson correlation coefficient in case of a bivariate normal input distribution. There is extensive documentation available here.First rows
| df_index | 0 | Floor | ForestDensityL | ForestDensityM | ForestDensityS | NoisePollutionRailwayL | NoisePollutionRailwayM | NoisePollutionRailwayS | NoisePollutionRoadL | NoisePollutionRoadM | NoisePollutionRoadS | Plot_area | PopulationDensityL | PopulationDensityM | living_area | rooms_combined | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 3564 | NaN | 1.0 | 0.275512 | 0.156081 | 0.210352 | 0.000000 | 0.000000 | 0.0 | 0.131270 | 0.143057 | 0.123900 | NaN | 0.019771 | 0.039614 | 119.0 | 5.5 |
| 1 | 801 | NaN | NaN | 0.223202 | 0.128919 | 0.010165 | 0.000000 | 0.000000 | 0.0 | 0.283924 | 0.354313 | 0.434566 | NaN | 0.093578 | 0.185027 | 116.0 | 4.5 |
| 2 | 19476 | NaN | 0.0 | 0.110777 | 0.040778 | 0.000000 | 0.000000 | 0.000000 | 0.0 | 0.170125 | 0.171316 | 0.094135 | NaN | 0.089755 | 0.196065 | 130.0 | 4.5 |
| 3 | 2340 | 2991000.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 4 | 15230 | NaN | NaN | 0.243851 | 0.098060 | 0.000000 | 0.000000 | 0.000000 | 0.0 | 0.132348 | 0.231978 | 0.366476 | NaN | 0.021085 | 0.037812 | 80.0 | 3.0 |
| 5 | 7959 | 690000.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 6 | 20468 | 2800000.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 7 | 2132 | NaN | NaN | 0.411550 | 0.394017 | 0.145943 | 0.000000 | 0.000000 | 0.0 | 0.044076 | 0.066994 | 0.089483 | NaN | 0.006555 | 0.009570 | NaN | 4.5 |
| 8 | 3921 | 790000.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 9 | 15301 | NaN | NaN | 0.004876 | 0.013387 | 0.000000 | 0.058849 | 0.023703 | 0.0 | 0.310368 | 0.335688 | 0.189308 | NaN | 0.493892 | 0.556291 | 247.0 | 6.5 |
Last rows
| df_index | 0 | Floor | ForestDensityL | ForestDensityM | ForestDensityS | NoisePollutionRailwayL | NoisePollutionRailwayM | NoisePollutionRailwayS | NoisePollutionRoadL | NoisePollutionRoadM | NoisePollutionRoadS | Plot_area | PopulationDensityL | PopulationDensityM | living_area | rooms_combined | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 990 | 17160 | 490000.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 991 | 7145 | 790000.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 992 | 10267 | 1550000.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 993 | 20917 | 215000.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 994 | 20046 | 1390000.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 995 | 16088 | NaN | 0.0 | 0.175336 | 0.077238 | 0.000000 | 0.0 | 0.0 | 0.0 | 0.121365 | 0.159631 | 0.245363 | NaN | 0.047252 | 0.092649 | 89.0 | 3.5 |
| 996 | 7722 | 440000.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 997 | 9383 | NaN | NaN | 0.620170 | 0.364002 | 0.000967 | 0.0 | 0.0 | 0.0 | 0.061410 | 0.065434 | 0.091063 | 30.0 | 0.055281 | 0.151853 | 90.0 | 3.5 |
| 998 | 20457 | 443000.0 | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN | NaN |
| 999 | 12531 | NaN | 2.0 | 0.600831 | 0.496903 | 0.489897 | 0.0 | 0.0 | 0.0 | 0.259858 | 0.383954 | 0.464767 | NaN | 0.031798 | 0.092418 | 163.0 | 5.5 |